Your browser doesn't support javascript.
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Añadir filtros

Base de datos
Tipo del documento
Intervalo de año
1.
medrxiv; 2024.
Preprint en Inglés | medRxiv | ID: ppzbmed-10.1101.2024.01.03.24300797

RESUMEN

IntroductionCOVID-19 can rapidly lead to severe respiratory problems and can result in an overwhelming burden on healthcare systems worldwide, making it imperative to identify high-risk patients and predict survival and need for intensive care (ICU). Most of the proposed modes are not well reported making them less reproducible and prone to high risk of bias. MethodsIn this study, the performances of seven classical machine (Random Forest (RF), Logistic Regression (LR), Support Vector Machine (SVM), k-Nearest Neighbor (KNN), XGBoost, Linear Discriminant Analysis (LDA) and Gaussian Naive Bayes (NB)) and two deep leaning models (Deep Neural Network (DNN) and Long Short-Term Memory (LSTM)) in combination with two widely used feature selection methods (random forest and extra tree classifier) were investigated to predict "last status" representing mortality, "ICU requirement", and "ventilation days". Fivefold cross-validation was used for training and validation purposes. In each fold, 80% data were used for training the models and the rest 20% were preserved for validation. To minimize bias, the training and testing sets were split maintaining similar distributions. Before splitting, k-nearest neighbour (KNN) imputation algorithm was employed to resolve the issue of missing data. On the other hand, bootstrapping technique was used for both oversampling and undersampling to address the issue of data imbalance. Publicly available 122 demographic and clinical features of 1384 patients were used. The performances of the models were evaluated using accuracy, sensitivity, specificity, and AUC (Area Under the Curve) of Receiver operating characteristic curves (ROC). ResultsOnly 10 features out of 122 were found to be useful in prediction modelling with "Acute kidney injury during hospitalization" feature being the most important one. Blood pH presents a decent discrimination capability especially in predicting "ICU requirement", and "ventilated days", Whereas gender and age are found to be vital in predicting "last status". It was observed that selecting more than 10 features lower the prediction accuracy. The performances of different algorithms depend on number of features and data pre-processing techniques. LSTM with the with balanced data and 10 features performs the best in predicting "last status" as well as "ICU requirement" with an average of 90%, 92%, 86% and 95% accuracy, sensitivity, specificity, and AUC respectively. DNN performs the best in predicting "Ventilation days" with 88% accuracy. For "ICU requirement" which is a binary prediction task, data pre-processing technique does not have any influence in making prediction and performances of different methods are comparable (89%, 98%, 78% and 95% accuracy, sensitivity, specificity, and AUC respectively). However, the number of features selected vary with data pre-processing technique. ConclusionConsidering all the factors and limitations including absence of exact time point of clinical onset, LSTM with carefully selected features can accurately predict "last status" and "ICU requirement" with approximately 90% accuracy, sensitivity, and specificity. DNN performs the best in predicting "Ventilation days". Appropriate machine learning algorithm with carefully selected features and balance data can accurately predict mortality, ICU requirement and ventilation support. Such model can be very useful in emergency and pandemic where prompt and precise decision making is crucial.


Asunto(s)
COVID-19 , Enfermedades Renales , Trastornos de la Memoria , Insuficiencia Respiratoria
2.
medrxiv; 2020.
Preprint en Inglés | medRxiv | ID: ppzbmed-10.1101.2020.10.01.20205146

RESUMEN

Early diagnosis of COVID-19 is considered the first key action to prevent spread of the virus. Currently, reverse transcription-polymerase chain reaction (RT-PCR) is considered as a gold standard point-of-care diagnostic tool. However, several limitations of RT-PCR have been identified, e.g., low sensitivity, cost, long delay in getting results and the need of a professional technician to collect samples. On the other hand, chest X-ray (CXR) is routinely used as a cost-effective diagnostic test for diagnosis and monitoring different respiratory abnormalities and is currently being used as a discriminating tool for COVID-19. However, visual assessment of CXR is not able to distinguish COVID-19 from other lung conditions. Several machine learning algorithms have been proposed to detect COVID-19 directly from CXR images with reasonably good accuracy on a data set that was randomly split into two subsets for training and test. Since these methods require a huge number of images for training, data augmentation with geometric transformation was applied to increase the number of images. It is highly likely that the images of the same patients are present in both the training and test sets resulting in higher accuracies in detection of COVID-19. It is, therefore, vital to assess the performance of COVID-19 detection algorithm on an independent data set with different degrees of the disease before being employed for clinical settings. On the other hand, machine learning techniques that depend on handcrafted features extraction and selection approaches can be trained with smaller data set. The features can also be analyzed separately for various lung conditions. Radiomics features are such kind of handcrafted features that represent heterogeneous appearance of the lung on CXR quantitatively and can be used to distinguish COVID-19 from other lung conditions. Based on this hypothesis, a machine learning based technique is proposed here that is trained on a set of suitable radiomics features (71 features) to detect COVID-19. It is found that Support Vector Machine (SVM) and Ensemble Bagging Model Trees (EBM) trained on these 71 radiomics features can distinguish between COVID-19 and other diseases with an overall sensitivity of 99.6% and 87.8% and specificity of 85% and 97% respectively. Though the performance is comparable for both methods, EBM is more robust across severity levels. Severity, in this case, was scored between 0 to 4 by two experienced radiologists for each lung segment of each CXR image represents the degree of severity of the disease. For the case of 0 severity, sensitivity and specificity of the EBM method are 91.7% and 100% respectively indicating that there are certain radiomics pattern that are not visibly distinguishable. Since the proposed method does not require any manual intervention (e.g., sample collection etc.), it can be integrated with any standard X-ray reporting system to be used as an efficient, cost-effective and rapid early diagnosis device. It can also be deployed in places where quick results of the COVID-19 test are required, e.g., airports, seaports, hospitals, health clinics, etc.


Asunto(s)
COVID-19 , Anomalías del Sistema Respiratorio
3.
medrxiv; 2020.
Preprint en Inglés | medRxiv | ID: ppzbmed-10.1101.2020.06.24.20139410

RESUMEN

Background: The lung CT images of COVID-19 patients can be characterized by three different regions: Ground Glass Opacity (GGO), consolidation and pleural effusion. GCOs have been shown to precede consolidations. Quantitative characterization of these regions using radiomics can facilitate accurate diagnosis, disease progression and response to treatment. However, according to the knowledge of the author, regional CT radiomics analysis of COVID-19 patients has not been carried out. This study aims to address these by determining the radiomics features that can characterize each of the regions separately and can distinguish the regions from each other. Methods: 44 radiomics features were generated with four quantization levels for 23 CT slice of 17 patients. Two approaches were the implemented to determine the features that can differentiate between lung regions: 1) Z-score and correlation heatmaps and 2) one way ANOVA for finding statistically significantly difference (p<0.05) between the regions. Radiomics features that show agreement for all cases (Z-score, correlation and statistical significant test) were selected as suitable features. The features were then tested on 52 CT images. Results: 10 radiomics features were found to be the most suitable among 44 features. When applied on the test images, they can differentiate between GCO, consolidation and pleural effusion successfully and the difference provided by these 10 features between three lung regions are statistically significant. Conclusion: The ten robust radiomics features can be useful in extracting quantitative data from CT lung images to characterize the disease in the patient, which in turn can help in more accurate diagnosis, staging the severity of the disease and allow the clinician to plan for more successful personalized treatment for COVID-19 patients. They can also be used for monitoring the progression of COVID-19 and response to therapy for clinical trials.


Asunto(s)
COVID-19 , Derrame Pleural
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA